Dependency-Based PropBanking of Clinical Finnish
نویسندگان
چکیده
In this paper, we present a PropBank of clinical Finnish, an annotated corpus of verbal propositions and arguments. The clinical PropBank is created on top of a previously existing dependency treebank annotated in the Stanford Dependency (SD) scheme and covers 90% of all verb occurrences in the treebank. We establish that the PropBank scheme is applicable to clinical Finnish as well as compatible with the SD scheme, with an overwhelming proportion of arguments being governed by the verb. This allows argument candidates to be restricted to direct verb dependents, substantially simplifying the PropBank construction. The clinical Finnish PropBank is freely available at the address http://bionlp.utu.fi.
منابع مشابه
Towards a Dependency-based PropBank of General Finnish
In this work, we present the first results of a project aiming at a Finnish Proposition Bank, an annotated corpus of semantic roles. The annotation is based on an existing treebank of Finnish, the Turku Dependency Treebank, annotated using the well-known Stanford Dependency scheme. We describe the use of the dependency treebank for PropBanking purposes and show that both annotation layers prese...
متن کاملFamilial Amyloid Polyneuropathy Type IV (FINNISH) with Rapid Clinical Progression in an Iranian Woman: A Case Report
Familial amyloid polyneuropathy (FAP) type IV (FINNISH) is a rare clinical entity with challenging neuropathy and cosmetic deficits. Amyloidosis can affect peripheral sensory, motor, or autonomic nerves. Nerve lesions are induced by deposits of amyloid fibrils and treatment approaches for neuropathy are challenging. Involvement of cranial nerves and atrophy in facial muscles is a real concern i...
متن کاملDependency Annotation of Wikipedia: First Steps Towards a Finnish Treebank
In this work, we present the first results obtained during the annotation of a general Finnish treebank in the Stanford Dependency scheme. We find that the scheme is a suitable syntax representation for Finnish, with only minor modifications needed. The treebank is based on text from the Finnish Wikipedia, ensuring its free distribution and broad topical variance. To assess the suitability of W...
متن کاملParsing Clinical Finnish: Experiments with Rule-Based and Statistical Dependency Parsers
In this paper, we present a new syntactically annotated corpus consisting of daily notes from an intensive care unit in a Finnish hospital. Using the corpus, we perform experiments with both rule-based and statistical parsers. We apply an existing rule-based parser specifically developed for this clinical language and create a set of conversion rules for transforming the constituency scheme of ...
متن کاملSpecifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
Corpus-based treebank annotation is known to result in incomplete coverage of midand low-frequency linguistic constructions: the linguistic representation and corpus annotation quality are sometimes suboptimal. Large descriptive grammars cover also many midand low-frequency constructions. We argue for use of large descriptive grammars and their sample sentences as a basis for specifying higher-...
متن کامل